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Abstract The statistics of records in sequences of independent, identically 
distributed random variables is a classic subject of study. One of the earli- 
est results concerns the stochastic independence of record events. Recently, 
records statistics beyond the case of i.i.d. random variables have received 
much attention, but the question of independence of record events has not 
been addressed systematically. In this paper, we study this question in de- 
tail for the case of independent, non-identically distributed random variables, 
specifically, for random variables with a linearly moving mean. We find a rich 
pattern of positive and negative correlations, and show how their asymptotics 
is determined by the universality classes of extreme value statistics. 
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1 Introduction 

The interest in studying record events in a general sense is evident as they 
by definition tend to be rare events of a special nature. As examples of 
record-breaking events most readers might think about athletic records like 
a world record in 100 m dash, or long jump [1,^^. However records also play 
an important role in other naturally occurring and man-made phenomena. In 
particular, climatic records attract much attention and are frequently men- 
tioned in the media in the context of global warming. The statistics of record 
temperatures, precipitation amounts and floods have been adressed exten- 
sively in recent work [3ll4l[5l[6l[7l[8ll9lfT0] . Also in evolutionary biology record 
events have a certain relevance as was pointed out in PTIIT^IT^ . and they 
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play an important role in the dynamics of disordered systems like spin glasses 
and dirty superconductors [H1IT5] . Last but not least records can be stud- 
ied in the financial sciences, e.g. when considering record-breaking values or 
changes in stock prices }16| . In all these examples not only the rate of record 
events is of interest but also their correlations: Given that a record flood just 
occurred, can one relax because it will take a while before the next record 
flood is to be expected? Based on our previous work in |T7], this question 
will be addressed in the present paper. 

Most previous studies of record events and record values [T0l[T8lfT9l[20] 
have been restricted to the case where the underlying random variables (RV's) 
{Xi} are independently drawn from a common distribution f{x) (i.i.d. RV's). 
In this case, record events can be shown to be stochastically independent, 
i.e. the knowledge about previous record events does not indicate whether 
a future record event is more or less likely. However in many applications, 
the underlying random variables are not i.i.d. For example, in recent work 
Majumdar and Ziff computed the statistics of records in a symmetric ran- 
dom walk [21J (see also [l6l[22l[23] ). Here, record events are obviously not 
independent: In a random walk the probability for a second record to occur 
immediately after after a previous record is simply 1/2, whereas the uncon- 
ditional probability for a record in the TV'th step is « [?!] . 



In this article we consider correlations between records in sequences of 
RV's that are independent but not identically distributed. An important ex- 
ample is the statistics of hot or cold records in weather data. In j7j the occur- 
rence of record-breaking temperatures in European and American weather 
recordings was analyzed by modeling daily temperature measurements as 
uncorrelated RV's with a linear drift. It was shown that the increase of hot 
records and the decrease of cold records due to global warming can be de- 
scribed and quantified by this model. We will refer to this particular example 
of non-i.i.d. RV's, which was originally introduced in the 1980's in the context 
of athletic records [Mll^ . as the Linear Drift Model (LDM). 

The precise definition of the LDM is as follows: For a positive constant 
c, each random variable Yi has an i.i.d. part Xi with density f{x) and an 
additive linear drift such that 



In other words, while the shape of the density fi{y) of the Z*'' RV is invariant, 
its overall position is shifted by a constant amount c each time a new RV 
is drawn. As was already noted in |17) . the record events under this model 
are not in general independent (see also [26] for the related case of RV's 
with changing variance). Below we shall see that the sign and magnitude of 
correlations depend in a highly non-trivial way on the underlying probabil- 
ity density f{x). To quantify the correlations, consider the joint probability 
Pn,n-i of a record occurring in the {N — 1)*'' as well as in the A^*'* step. 




Yi=Xi+ cl. 



(1) 



Pn,n-i = Prob[Y/v record and Yat-i record]. 



(2) 



To normalize this joint probability, we divide by the probabilities pn and 
Pn-1 of records in the N*'^ and A^ — 1*'' steps, obtaining the key quantity 
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Fig. 1 Correlations between record events in the 7**^ and S*** RV for difTerent 
distributions of the underlying i.i.d. part. For the Gumbel case, the well known 
stochastic independence of record events is found, while simulations for the other 
distributions show that generally the records are correlated. Note in particular 
the curve for the symmetric Levy-stable distribution with index p = 1.3, which 
indicates an attraction between record events. 



under consideration throughout this paper, 

iN,N-i\c) = . (3) 

PnPn-i 

The simulations shown in Fig. [T] illustrate two known results, the fact that 
record events for i.i.d. RVs are stochastically independent [I8lll9l[20] (all 
curves meet at unity for c = 0), and that for the LDM with Gumbel dis- 
tributed i.i.d. part, record events are stochastically independent for all val- 
ues of c [TfllT^I^I^ . However the curves for other distributions show that 
stochastic independence does not hold in general. For some choices of the 
underlying probability density f{x) record events tend to repel each other 
{liq,N-i{c) < 1), wheras in other cases the correlations are positive, i.e. record 
events attract each other {lN,N-iic) > 1). In particular this last, somewhat 
counterintuitive numerical finding triggered our interest in this problem. 

In the next section we will derive a general expression for the correla- 
tions between record events as quantified by In^n-i{c). For this we employ 
methods similar to those used in [T7]. In Section [31 we provide some explicit 
examples illustrating the rich and counterintuitive behavior of the correla- 
tions. In particular, we discuss the behavior of correlations with respect to 
the three classes of extreme value statistics [10','27''28','29! . We find that while 
in the WeibuU class of distributions with finite support the correlations are 
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generally negative, positive correlations mainly arise when the RV's are in 
the Frechet class of distributions with power law tails. In the intermediate 
Gumbel class of distributions with exponential-like tails correlations can be 
both positive and negative. Finally in Section 21 we summarize our results 
and discuss possible applications and open questions. 



2 General theory 

Consider a sequence {Xi} of independent but not necessarily identically dis- 
tributed RV's, with fi{x) denoting the probability density of Xi. The proba- 
bility that the maximum of the first N RV's is smaller than a given value x 
then factorizes into the probabilities of the RV's individually being less than 

X, 

Prob[max{Xi,...,Xjv} < A ^\{ fi{x') = \{Fi{x), (4) 

i=i-' 1=1 

where Fi{x) = dx' fi{x') denotes the cumulative distribution function for 
the density fi{x), and the omitted lower (upper) integral boundary is un- 
derstood to be the lower (upper) bound of the support of // respectively. 
Throughout we assume distribution functions to be continuous. 

Using Eq.|3]the probability pN that the N*^ RV is a record can be written 

as 

. JV-l 

Pn 

1=1 



I dxfNix) n Fi{x), (5) 
1=1 

and the joint probability that both X^^i and Xn are records is 



dxwfNixN) / da;Ar_i/jv-i(a;Ar_i) Y[ Fi{xn-i)- (6) 

1=1 



For the LDM we have fi{y) = f{y — cl) and Fi{y) = F{y — cl), where f{x) 
and F(x) are the density and distribution function of the i.i.d. part Xi in 
(H]). It then follows that [T7] 

. N-l N-l 

PN= dy fiy - cN) [] F{y ~ cl) = / da; f{x) J] F(x + cl) (7) 

1=1 '' 1=1 

and similarly 

/ry+c N — 2 

dyf{y) J dxf{x) n F{x + d). (8) 
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2.1 Explicit stochastic independence for two cases 

The expressions ^ and ([8]) allow us to verify two known results: Both for 
the i.i.d. case (c = 0) and for the LDM with Gumbel distributed i.i.d. part 
Xi, record events are independent. 

We start with the i.i.d. case. First note that the probability that the A^*^ 
RV is a record is the probability that it is the largest among N equivalent 
RV's, so by symmetry pn{c = 0) = 1/N. For the i.i.d. case, the subscripts in 
([S]) can be dropped. Using the substitution u = F{x) and du = f[x)dx^ one 
obtains 



Pn,n-i{c = ^) = y J du'u'^"^ = ^ _ ^ — = pnPn-1- (9) 



It is possible to insert an arbitrary spacing between the two record events 
and still obtain this factorization to yield pM,N~j = PnPn-j- Similarly, one 
can consider more than one record event. The factorization property implies 
that in the i.i.d. case, record events are stochastically independent P^SlfTOl 
[20]. 

Next we consider the LDM with Gumbel distributed i.i.d. part, 

f{x) = exp (-e-- - x) (10) 
and F{x) — exp (— e^^). Using that in this case [57] 

F{x + a) = exp (-e""-") = exp (-e'")'"° = F{xY'^ , (11) 
the substitution u = F{x) used above yields [TTll^ 

p^{c) = / dxfix) n F{xr = / duj:.^^ " = ^ (12) 

1=1 -'0 1^1=0 ^ 

with a = e~'^ , and 



Pn,n-\{c)^ \ \ du'u ^'=1 " = — ^^^^2 — / du u^'=i 



1 



■PN-lPN- (13) 



Again one can introduce an arbitrary number of spaces and intermediate 
records and still have the corresponding joint probability factorize. This im- 
plies that for the LDM with Gumbel distributed i.i.d. part, as for arbitrary 
distributions without drift, record events are stochastically independent. 
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Fig. 2 Correlations between record events as the number of RV's A'' increases. 
For the i.i.d. part drawn from a Gaussian distribution with variance a — 1, the 
correlations are negative and for sufficiently large TV tend to a limiting form. For 
the symmetric Levy distributed i.i.d. part, the correlations are positive and increase 
with N. 



2.2 Expansion for small drift 

As shown above, for the i.i.d. case c = 0, record events are independent. On 
the other hand, for very large drift, almost every RV is a record, and thus 
trivially In,n-i{c) — )■ 1 for c — >■ 1 (see Fig. [T]). The behavior of In,n-i{c) in 
between these two limits is not as easily characterized. In particular, while 
for some probability densities the records seem to repel each other for inter- 
mediate c (jN.N-iic) < 1), for other probability densities the records seem 
to attract {In,n-i{c) > 1). 

Figure [2] shows examples for both types of behavior, illustrating in addi- 
tion how the correlations evolve with increasing N for the cases of Levy and 
Gauss distributed i.i.d. parts. Clearly, the correlations are not a finite size 
effect, but their A^-dependence is markedly different in the two cases. For the 
Gaussian case, the correlations are negative and seem to approach a form 
independent of N. For the Levy case, on the other hand, the correlations 
seem to increase with N. We will argue later that the correlations eventually 
become A'^-independent also for heavy-tailed distributions, but this limit is 
approached much more slowly than in the Gaussian case. 

A first step towards understanding the behavior of the correlations is to 
determine the sign with which the curves in FigsH] and [5] depart from the 
i.i.d. case c = 0. To address this question, we compute the record rate pn{c) 
and the joint probability pn.n-i{c) in the limit of small c and cja <Si N~^, 
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where a is the standard deviation or some other measure of the width of the 
distribution. For the record rate this problem was discussed extensively in 
[T7] (see also [7]). Knowing that pn{c = 0) = ^/N, a Taylor expansion for 
small c yields 

PN + c ^ ^ > IiN - 2), . (14) 

where we have defined the quantitjQ 

I{N)= f dxf (x) (x) . (15) 



In the same small c regime we can also easily determine the joint probability 
of occurrence of two consecutive records pn.n-i (c). To leading order in c, 
the expansion of Eq.® yields 

PN.N-i (c) « J dyf (y) J' dxf {x) {x)+cj dyf {y) F^"-^ [y] 

The first term can be evaluated by partial integration to -j^^ziYj^ ^^'^ another 
partial integration yields 

(17, 

+ c '"-"^'"-^' /(iV-3). (18) 

In addition to the zeroth order term , we find a positive and a negative 

correction term that describe the effect of the drift. Depending on which one 
of these two terms is larger, positive or negative correlations will result. 

A similar expansion can be set up for the normalized correlation Zjv,Af-i(c) 
defined in Eq.Q. Writing 

In,n~i{c)^1 + c.J{N) + 0{c^), (19) 

a straightforward computation yields 

J{N) = N{N - 1)I{N - 2) + ^(^- _ 3) _ _ 2)] 

-^^^^^HN - 2) - li^^^im^KiV - 3) (20) 

Again, both positive and negative terms occur in (201), the relative magni- 
tude of which depends explicitly on the underlying probability density f{x). 
Thus from (j20l) . it is not at all obvious how the counterintuitive positive 

^ No te that this de finit ion differs slightly from that of the related quantity /„ used 
in [T7] . The integral (|15p also appears in the analysis of the density of near-extreme 
events l30l. 
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correlations observed numerically arise, how their occurrence depends on the 
properties of / and how the clustering of record events behaves as a function 
of the number TV of RV's considered. Before turning to the explicit evalu- 
ation of J{N) in Section |31 we provide some heuristics on the asymptotics 
of the correlations in the limit of very large iV, and address the behavior of 
correlations between RV's that are more than one time step apart. 

2.3 Asymptotics for large N 

It is clear from Eqs. ([7]) and (|S]) that the asymptotic behavior of the record 
rate pn and the joint probability pn,n-i hinges on the existence of the 
function 

N 

Gc{x)= \im Y[F{x + cj) (21) 

A' — f oo 

i=i 

which was rigorously examined by Ballerini and Resnick. In |24j , they prove 
that the limit d^I]) exists and is nonzero whenever the density f{x) has a finite 
first moment. For completeness we provide a heuristic version of their argu- 
ment. Taking the logarithm of Eq. (PT|) one has to consider the convergence 
of the series 

CO 

ln(G,(a;))=^ln(i^^(x + cj)). (22) 

Now F{x + cj) = Prob(X < a; -I- cj) = 1 — Prob(X > x -I- cj) and for any 
finite X and c > there is a j large enough such that Prob(X > a; -I- cj) <C 1. 
Therefore 

oo 

In {Gc{x)) « s.{x) - Prob(Js: >x + cj), (23) 

where = Sj=i ln(i^(a; + cj) is a finite sum and the logarithms in the 
remaining series were approximated by their Taylor expansion around unity. 
The condition for f{x) to have a finite first moment is that Prob(X > x) 
decays faster than l/x, which implies convergence of the infinite series in 
([23)) and hence of Gc{x) (in the opposite case the series diverges to — oo and 
Gc = 0). Since Gc{x) is a probability, it is bounded from above by unity and 
it follows from ([7]) and ([S]) that pn{c) and pN,N-iic) have finite, non-zero 
limits for ^ oo. The same statement then applies to the ratio ([3]). 

Although the focus of this paper is on the case of a positive drift which 
enhances the occurrence of (upper) records, it is instructive to compare the 
asymptotic behavior for c > described above to the case c < 0. For negative 
drift both the record rate p^r and the joint probability pjv,jv-i vanish for N — > 
oo, and they do so more rapidly than their i.i.d. counterparts. The asymptotic 
behavior of the ratio ^ is then a priori undetermined. We consider a simple 
example where the quantities of interest can be computed explicitly. Let the 
probability density of the i.i.d. variables be given by the negative exponential 
distribution, 

/(x)=e^ x<0 (24) 
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and f{x) =0 elsewhere. Then for c < the integrand in ^ and (|8]) is of the 
form 



K 



Yl F{x + cl) = exp Kx + -K{K + 1 



1=1 

and integration yields 
1 



, a; < 0, 



(25) 



Pn{c) = ]^exp 



exp 



-N{N~1) 



(26) 

For c < the record rate is suppressed superexponentially, such that the ex- 
pected number of records remains finite for iV — > oo (see also |17]). However, 
forming the ratio ([3]) we see that 



In,n-i = exp 



-(iV- l)(iV- 2) 



(27) 



which, in contrast to the case of positive drift, grows without bound for 
N oo. We will see in Section [3] that such a behavior is not restricted 
to this specific example. The non-existence of the N oo limit for c < 
suggests that the limiting function 



r(c) = lim In.n~i{c) 

N-^OQ 



(28) 



may not be smooth for c — > 0, as will be explicitly demonstrated in Section 

m 



2.4 Dependence on the distance between record events 

Before embarking on the study of specific distributions, we briefly comment 
on the behavior of the correlations as a function of the distance between 
record events. Denoting hy pN,N-k the joint probability for observing a record 
in the N — fcth and the iVth event, it was shown in [53] that 



lim lim pN,N-k{c) 

k—^oo N—^oo 



N 



lim PAr(c) = p{c) 



(29) 



which implies that record events become uncorrelated for A: — oo . This leads 
to the expectation that in a finite time series, the correlations between record 
events at distance k should decay with k, in agreement with the simulations 
shown in Fig. [31 The correlations are seen to be maximal at fc = 1. In the 
present paper we therefore focus on the case of nearest neighbor correlations, 
although extending the computations to fc > 1 is in principle straightforward. 
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Fig. 3 Correlations between record events at distance k from time series of length 
A'^ = 32 for Gaussian (cr = 1), Levy-stable (n — 1.5), uniform (on [0, 1]) and Pareto 
{fi = 1.5) distributions. Left: Full c dependence of /]v,jv_fc(c). Right: At the drift 
velocity Cmax where the correlations are maximal (cmax ~ 0.1 for Gaussian and 
uniform, Cmax = 0.2 for the Levy and Cmax ~ 0.4 for the Pareto case), /jv,]v-fc(cmax) 
is shown as function of k to illustrate the decay of correlations. 

3 Explicit examples 

Whereas the statistics of record events in sequences of i.i.d. RV's is completely 
universal 18,19,20], in the presence of drift one has to distinguish between 
distributions belonging to the three different universality classes of extreme 
value theory, the classes of Weibull, Gumbel and Frechet [T0ll28ll29) . For 
each of these classes we will analyze the correlations in the LDM for a few 
exemplary distributions, starting with the Weibull class, and summarize the 
observed behavior in a unifying scaling picture in Section 13.41 The results 
presented below rely on the study of the record rate pn (c) in the LDM 
presented in |T7], as well as on the expansions for pjv^jv_i (c) [Eq. (fT7|) ] and 
In,n^i (c) [Eqs. (|19I20|) ] derived above in Section 

3.1 The Weibull class 

The Weibull class is the class of distributions with bounded support. As a 
very simple member of the Weibull class we consider a uniform distribution on 
the interval [—a, a]. We compute In,n-i (c) in the small c regime by making 
use of ([ranUl) and find for iV > 1 

In^n-i{c)~1-^N^. (30) 

The correlations are negative and their magnitude increases rapidly with N 
(but note that the leading order approximation is valid only for cN'^ /2a <C 1). 
We performed numerical simulations to check this result with a = I and 
—0.1 < c < 0.4. Figure ID shows how the descent of In,n-i{c) for c > 
steepens with increasing N. In the figure we also plot our prediction (15(11) for 
small values of N, however the steepness of the descent at zero does not allow 
us to estimate the quality of our approximation reliably. The figure shows 
that for large enough TV and not too small c, Ij^^^-i (c) eventually becomes 
independent of N, as expected from the general arguments given in Section 
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Fig. 4 Left: Numerical simulations of In,n-i{c) for a uniform distribution of 
width a = 1. For each c and A*' we simulated 10^ time series. The figure shows data 
for N = 4,16 and 64. At c = we find a stee p de scent of In,n-i{c). For A'^ = 4 
and A'' = 16 we show the analytic prediction (|30|l (steep lines). With increasing 
c, In,n-i{c) appears to become completely independent of A''. Right: Simulation 
results for /iv,jv-i as a function of A^ for c = -0.01, -0.001, 0, 0.001 and 0.01. We 
simulated lO'' series of 100 RV's in each case. For c < we find a ver y st eep increase 
of In,n-i that is in good agreement with the analytical prediction pO|l . For c > 0, 
In,n-i quickly saturates at a constant value and our analytical results are not very 
useful. 



12.31 In this case the approach to the — > oo limit is particularly rapid, 
because distributions that are more than 2a/c steps apart do not overlap. 
The sharp initial drop of In^n-i{c) indicates that the limiting distribution 
I* (c) is discontinuous at c = 0, which is consistent with the non-existence of 
a limiting functior0 for c < 0. 

As a more general subset of the WeibuU class we next consider the class 
of distributions defined by 



f^{x)^ai-xf-' (31) 

with if > and < a; < 1. For if = 1 this produces a uniform distribution 
similar to the one used above. In |T7] we found that for ^ > i the integral in 
(fT5)) is given by 



r (2~ ^) r{N + 1) 
HN) = g \ (32) 



Using this and assuming ^ 1 we find the following expression for In^n-i (c): 

^^3/^(2- i)r(A^-i) 

^ With regard to its upper tail, the negative exponential distribution (|24|l is 
equivalent to the uniform distribution, and therefore the argument of Section [2.31 
applies here as well. 
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Fig. 5 Numerical Simulations ol In,n-i{c) for an exponential distributions with 
unit mean. For each c and A'^ we simulated 10® time series. The figure shows results 
for A'^ = 4, 16 and 64 along with our Af- independent analytical prediction pS)) . 



and making use of the Stirling approximation we finally obtain 

In,n-i (c) « 1 - |r (^2 - 1^ N'+l . (34) 

Similar to the uniform case, the correlations between neighboring record 
events are always negative and their magnitude increases rapidly with in- 
creasing N, suggesting a discontinuity of the limiting function l*{c) at c = 0. 



3.2 The Gumbel class 

As a first example in the Gumbel class we consider the exponential distribu- 
tion with mean a, / {x) = a~^e~'^ , a: > 0. In [171 it was found that in this 
case the effect of the linear drift on the record rate is independent of N to 
leading order in c. Using this result it is straightforward to obtain 

In,n-i{c)^1 + ^. (35) 

We compared this result to numerical simulations in Fig. [S] and found a very 
good agreement. Apparently Zjv,7v-i (c) assumes a value independent of N 
already for very small N. 

Another important member of the Gumbel class is the normal distribution 
with mean zero and standard deviation a. Here, the computation is usually 
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Fig. 6 Numerical simulations of In,n-i{c) for a Gaussian distribution of width 
a = 1. Left figure shows In,n-i{c) for TV = 4, 16 and 64. For each c and we 
simulated 10^ time series. The figure illustrates the steep descent of the correlator 
at c = when c is small. Right figure shows In,n-i{c) for different, fixed values of 
c together with our analytical results. Here, we analyzed 10^ series of RV's for each 
drift rate. Again, we find agreement between the simulations and our analytical 
computations for small and c. We manually fitted curves oc 7r). 
Interestingly for c < the agreement is a lot better and In,n-i{c) does not saturate 
at a constant value. 



a bit more complicated; however, as in the previous examples, most of the 
vifork was already done in [T7] . There we found that for ^ 1 and cN ^ a 



^(»)-^¥,A«fS. (36) 



Using (I^Ul) this result allows us to estimate the effect of the drift on the 
correlations. We find 



. c 



{lN.N-i{c)-l)<x-N^^\n(^^y (37) 

Unlike the case of the exponential distribution, in the Gaussian case the 
correlations are negative and depend strongly on N. The behavior is sim- 
ilar to that found in the WeibuU case, and matches the result (up to 
logarithimic factors) for ^ — ^ oo. Numerical results for the Gaussian distri- 
bution are shown in Fig. [S] Similar to the uniform distribution, ?jv,Ar-i (c) 
shows a steep descent at c = which becomes steeper for increasing TV. 
However, the descent appears to be smoother than in the uniform case. In 
agreement with the considerations of Section 12.31 for c > and large N the 
correlations become independent of N. In contrast, for c < the correlations 
increase monotonically with N and do not appear to saturate. In the case 
of the Gaussian distribution our analytical prediction p7p is only valid in 
a small regime for small c > and relatively small but for c < the 
approximation is significantly better. 

To understand the marked difference between the exponential and Gaus- 
sian distributions, we analyze the general class of Gumbel-type distributions 
of the form 

/(x)=A^e-l-l', (38) 
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with normalization Ap and /3 > 0. In this case it was found in [17] that 

/ (N) oc iV^^ln (N)^^^^' . (39) 

With some further computations this leads us to the following behavior of 
the leading order correction coefficient J (N): 

J{N) oc ~Di ^1 - Nln{N)^'^ + D2 In(iV)^"^ , (40) 

where Di and D2 are positive constants not depending on N. The exact val- 
ues of Di and D2 are very difficult to compute and we will not consider them 
in this article. Nevertheless, this expression nicely reproduces our results for 
the exponential and the Gaussian distribution and shows how both positive 
and negative correlations may emerge in the Gumbel class. For /? = 1 the 
leading first term vanishes and J{N) reduces to a positive constant. For all 
/? 7^ 1 the first term dominates and the correlations grow with N, with a sign 
determined by 1 — ^ ■ For values /? > 1 of distributions that decay faster than 
the standard exponential we find negative correlations between the records, 
in agreement with the Gaussian example, while for stretched exponential dis- 
tributions (/? < 1) positive, iV-dependent correlations result (see Fig. [5] for 
an example). 

In hindsight, the special role of the exponential distribution should not 
come as a surprise, since the Gumbel distribution (jlOl) (for which correla- 
tions are completely absent) has an exponential tail. In the sense of extreme 
value theory, the exponential distribution is close to the Gumbel, and the 
corresponding records in the LDM are therefore almost uncorrelated (up to 
small residual correlations which are independent of A^). 



3.3 The Frechet class 

As a subset of the distributions in the Frechet class of extreme value statistics 
we consider the well known Pareto distribution 

f{x)=fix-'^-' (41) 

with /i > 1 and x > 1. In |17j it was found that in this case 



r 2 + i r(iv + i) 

liN) = \ (42) 

This allows us to compute In.n-i (c) for the Pareto distribution. We find 
, /-(2 + t)r(iV-l) NiN-l){N^ + ,N + ,) 

which leads to the expression 
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Drift Speed c Number of events N 

Fig. 7 Left: Numerical simulations of In,n-i{c) for Pareto distributions. For 
each c and fj, we simulated 10^ time series of length TV" = 16. The figure shows 
simulations for /i = 2, 3 and 4, along with our analytical predictions for small c. 
At c = we find a steep asce nt o f In,n-i{c) which gets steeper with increasing 
/i. Our analytical result from (|44p is in good agreement with the simulations for 
small enough c. Right: Simulation results for a Pareto distribution with fi = 2 
and different values of c. We analyzed 10^ series of = 100 RV's for each c. Our 
analytical work predicts a square-root behavior of In,n-i{c) — 1, which is in good 
agreement with the simulations. 



In,n-i (c) « 1 + (^2 + N^-f^" (44) 

for large A''. For positive c, we thus expect positive correlations between 
neighbouring record events for all distributions of Pareto form. These corre- 
lations are increasing with N slower than linearly. We expect that, asymp- 
totically, this behavior is universal for all distributions within the Frechet 
clasfl We compare Eq. ([^^ to numerical simulations in Fig. [71 finding good 
agreement for small c, both for the c- and the A^- dependence. The agreement 
improves when the distribution becomes broader for smaller /i. The strength 
of correlations for strongly heavy-tailed distributions of the Pareto class is 
remarkable. For example, for a Pareto distribution with coefficient /i = 2 we 
found correlations In^n-i ~ 3 for large N (see Fig. 

The correlations displayed in the right panel of Fig 17] increase (for c > 0) 
or decrease (for c < 0) with TV without showing any sign of saturation, 
although we know from the general considerations of Section [2. 31 that In,n-i 
must approach an iV- independent limit for c > and /i > 1. As we will see in 
the next subsection, this reflects the fact that the time scale A^* at which the 
limit is attained is (for a given value of c) particularly large for distributions 
of the Frechet class. We also note that because In,n-i < 1 for c < 0, the 
normalized correlations are confined to lie between and 1 in this case. This 
implies that the divergence of lN,N-iic) for N ^ oo and c < 0, which was 
demonstrated in Section 12.31 to occur for a specific example in the WeibuU 
class, cannot happen here. We therefore conjecture that, for distributions in 
the Frechet class, the limit ([28]) exists also for c < 0. 

^ In (TJ we presented data for a Levy stable distribution with fi — 1.8 which show 
negative correlations between record events. We have checked that this behavior is 
not asymptotic, and that the correlations become positive for larger values of A'^. 
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3.4 Unified picture 

The results of the preceding subsections can be summarized in a simple 
scaling picture. We first recall from JT] and that the LDM (with c > 
0) contains a characteristic time scale N* at which the record rate pn{c) 
crosses over from the i.i.d. behavior p^r ~ ;^ to the limiting value p{c) = 
limjv-^oo PAr(c) > 0. For the different distributions discussed above, this time 
scale diverges for c according to 

A^* cx c^TTs WeibuU class (45) 
A^* cx 1 1 In c| ^ " ^ Gumbel class (46) 
A^* cx c"^ Frechet class (^ > 1) . (47) 

Ignoring for the moment the logarithmic factor in the Gumbel case, these 
behaviors can be further simplified by expressing the different universality 
classes in terms of the generalized Pareto distribution [31] 

f{x)^{l + Kx)'^ . (48) 

For K > this is of Pareto type with /i = i, for k < it reduces to a 
Weibull-type distribution similar to pip with £, ~ and the Gumbel 

class is represented by the exponential distribution which arises from (|48)) 
for K — T' 0. Using this representation, the different cases in (45), (46) and 
(47) reduce to 

N* (X c"" with = . (49) 

1 — K 

Note that ^ < 1 in the Weibull class but > 1 in the Frechet class, which 
explains the slow convergence to the iV — > oo limit in the latter case. More- 
over, it was shown in [T7] that the integral ([T5]) behaves for large N as (see 
also 130]) 

I{N) ^ N^-^ = N-<-^+''l (50) 

To relate this to the behavior of the correlations, we note that for large TV 
Ea. (|20p can be approximately written as 

J{N) ~ ~^^''^^(^) - N^HN) + 0{N^IiN)). (51) 

Inserting ([50)) we thus conclude that, to leading order, 

J{N) w |iV3/(iV) - N^^"", (52) 

which correctly reproduces both the sign and the order of magnitude of the 
correlations derived in Section [3] for the Weibull and Frechet classes: Corre- 
lations are positive (negative) for k > (k < 0), and they scale sublinearly 
(superlinearly) with N in the two cases. A similar calculation using the re- 
fined estimate (l39l) of I{N) for the Gumbel class shows that the correlations 
are negative (positive) for /3 > 1 (/? < 1), in agreement with (|40)) . 
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Fig. 8 Numerical simulations of In.n-i{c) with c = 0.05 for six different distribu- 
tions. In each case we averaged over 10*^ time series of length A'^ = 10^. The results 
for In,n-i{c) were binned logarithmically in order to improve the averaging for 
large A'^. For the uniform and the Gaussian distribution /jv,jv-i(c) is clearly nega- 
tive for c > and it decreases with growing A'^. At some TV = A^* the correlations 
become independent of A^. For the special case of the exponential distribution we 
find a constant value of Zjv,jv-i(c) which is only slightly larger than unity. Both the 
stretched exponential (with /3 — 1/2) and the Pareto distributions (with fi = 2,3, 4) 
show positive correlations. Note the slow convergence for the Pareto distribution 
with fj. = 2. 



4 Conclusion 



In this paper we analyzed the effect of a linear drift on the correlations 
between records drawn from series of independent RV's, as quantified by 
the normalized joint probability Z^v.at-i (c). In Section 2 we derived general 
expressions (exact and approximate) for In^n-i (c) and recalled the fact that 
records are independent both for i.i.d. RV's (c — 0) and in the special case 
where the RV's are drawn from the Gumbel distribution. 

Our main analytic results were obtained in Section 3 by way of an ex- 
pansion in the small c limit, similar to the approach developed previously 
in [17j. Using this approach we were able to show that the correlations are 
generally negative ('repulsive') for distributions in the Weibull class, and pos- 
itive ('attractive') for distributions in the Frechet class. In the Gumbel class 
the sign of the correlations depends on the stretching exponent (3 in (l38)) . 
with the border between positive and negative correlations being given by 
the exponential case /3 = 1. In contrast to all other cases, for distributions 
with an exponential tail the correlations are weak and independent of N, 
which is consistent with the fact that this class of distributions also contains 
the Gumbel distribution, which has no correlations at all. Simulation results 
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illustrating the different cases are summarized in Fig. \8\ A special role of the 
exponential distribution in separating two regimes of qualitatively different 
behaviors has been noted previously in the related context of near-extreme 
events [3D]. 

Perhaps the most surprising and counterintuitive outcome of our work is 
the discovery of strong positive record correlations for distributions with a 
power law taiQ. In view of the substantial interest in detecting and explaining 
heavy-tailed distributions in all areas of science , our finding suggests 

that drift-induced record correlations could be used as a distribution-free test 
for detecting power laws or streched exponentials in empirical data [55] . 

An interesting open question concerns the structure of the record corre- 
lations in the asymptotic limit N ^ oo, where the record rate approaches a 
nonzero constant and the record process thus becomes stationary. Based on 
the work of Ballerini and Resnick [23, we have argued in Section [^?51 that the 
limit ([25]) exists for c > whenever the underlying distribution has a finite 
mean, whereas for c < it is possible that lN,N-iic) diverges for N — >■ oo. 
The fact that the coefficient of the leading order term in the small c expan- 
sion generally diverges with N indicates that the limiting function l*{c) may 
be singular for c — ?> 0, and we have presented numerical evidence for the 
occurrence of a discontinuity at c = for the WeibuU class. For heavy-tailed 
distributions in the Frechet class, the large- A'^ asymptotics is difficult to as- 
certain numerically because of the slow convergence, but we have argued that 
in this case r{c) may exist also for c < 0. Rigorous work addressing these 
questions along the lines of would be most welcome. 
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